Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 3652 |
| Missing cells | 597 |
| Missing cells (%) | 0.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 513.7 KiB |
| Average record size in memory | 144.0 B |
Variable types
| NUM | 13 |
|---|---|
| CAT | 3 |
| BOOL | 2 |
Temperature_Max is highly correlated with Temperature_Midday and 2 other fields | High correlation |
Temperature_Midday is highly correlated with Temperature_Max and 2 other fields | High correlation |
Temperature_Min is highly correlated with Temperature_Midday and 2 other fields | High correlation |
Temperature_Evening is highly correlated with Temperature_Midday and 2 other fields | High correlation |
Special_Event is highly correlated with Holiday | High correlation |
Holiday is highly correlated with Special_Event | High correlation |
Snow_5Days has 565 (15.5%) missing values | Missing |
Day is uniformly distributed | Uniform |
Sunshine_Percentage has 807 (22.1%) zeros | Zeros |
Snow_5Days has 2793 (76.5%) zeros | Zeros |
Temperature_Deviation has 43 (1.2%) zeros | Zeros |
Precipiation_5Days has 511 (14.0%) zeros | Zeros |
Precipiation has 1876 (51.4%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-21 11:19:44.267315 |
|---|---|
| Analysis finished | 2020-09-21 11:20:14.190022 |
| Duration | 29.92 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
Passengers
Real number (ℝ≥0)
| Distinct | 3045 |
|---|---|
| Distinct (%) | 83.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6387.8023 |
|---|---|
| Minimum | 0 |
| Maximum | 34878 |
| Zeros | 33 |
| Zeros (%) | 0.9% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 825 |
| Q1 | 1547 |
| median | 4029 |
| Q3 | 9771 |
| 95-th percentile | 18593.95 |
| Maximum | 34878 |
| Range | 34878 |
| Interquartile range (IQR) | 8224 |
Descriptive statistics
| Standard deviation | 6033.488385 |
|---|---|
| Coefficient of variation (CV) | 0.9445327362 |
| Kurtosis | 0.9687784571 |
| Mean | 6387.8023 |
| Median Absolute Deviation (MAD) | 2909 |
| Skewness | 1.236183308 |
| Sum | 23328254 |
| Variance | 36402982.09 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.9% | |
| 1010 | 5 | 0.1% | |
| 1525 | 5 | 0.1% | |
| 1625 | 5 | 0.1% | |
| 1241 | 5 | 0.1% | |
| 1057 | 4 | 0.1% | |
| 939 | 4 | 0.1% | |
| 1623 | 4 | 0.1% | |
| 902 | 4 | 0.1% | |
| 1418 | 4 | 0.1% | |
| Other values (3035) | 3579 | 98.0% |
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.9% | |
| 24 | 1 | < 0.1% | |
| 54 | 1 | < 0.1% | |
| 523 | 1 | < 0.1% | |
| 544 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 34878 | 1 | < 0.1% | |
| 31326 | 1 | < 0.1% | |
| 30549 | 1 | < 0.1% | |
| 30415 | 1 | < 0.1% | |
| 30174 | 1 | < 0.1% |
Revision
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.5 KiB |
| 0 | |
|---|---|
| 1 | 35 |
| Value | Count | Frequency (%) | |
| 0 | 3617 | 99.0% | |
| 1 | 35 | 1.0% |
| Distinct | 383 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.84942497 |
|---|---|
| Minimum | -11.2 |
| Maximum | 32.7 |
| Zeros | 6 |
| Zeros (%) | 0.2% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | -11.2 |
|---|---|
| 5-th percentile | -0.9 |
| Q1 | 6 |
| median | 13.3 |
| Q3 | 19.5 |
| 95-th percentile | 26.4 |
| Maximum | 32.7 |
| Range | 43.9 |
| Interquartile range (IQR) | 13.5 |
Descriptive statistics
| Standard deviation | 8.509758988 |
|---|---|
| Coefficient of variation (CV) | 0.6622676895 |
| Kurtosis | -0.8275978816 |
| Mean | 12.84942497 |
| Median Absolute Deviation (MAD) | 6.7 |
| Skewness | -0.04654652102 |
| Sum | 46926.1 |
| Variance | 72.41599803 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10.4 | 23 | 0.6% | |
| 19.5 | 23 | 0.6% | |
| 19 | 22 | 0.6% | |
| 18.5 | 22 | 0.6% | |
| 19.2 | 21 | 0.6% | |
| 16.7 | 21 | 0.6% | |
| 10.2 | 21 | 0.6% | |
| 15.4 | 21 | 0.6% | |
| 16.9 | 21 | 0.6% | |
| 14.3 | 21 | 0.6% | |
| Other values (373) | 3436 | 94.1% |
| Value | Count | Frequency (%) | |
| -11.2 | 1 | < 0.1% | |
| -11 | 1 | < 0.1% | |
| -10.8 | 1 | < 0.1% | |
| -10 | 1 | < 0.1% | |
| -9.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 32.7 | 1 | < 0.1% | |
| 32.5 | 2 | 0.1% | |
| 32.3 | 1 | < 0.1% | |
| 32.2 | 1 | < 0.1% | |
| 32 | 2 | 0.1% |
| Distinct | 103 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.03216053 |
|---|---|
| Minimum | 0 |
| Maximum | 101 |
| Zeros | 807 |
| Zeros (%) | 22.1% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.75 |
| median | 28 |
| Q3 | 68 |
| 95-th percentile | 98 |
| Maximum | 101 |
| Range | 101 |
| Interquartile range (IQR) | 66.25 |
Descriptive statistics
| Standard deviation | 34.98270198 |
|---|---|
| Coefficient of variation (CV) | 0.9446573324 |
| Kurtosis | -1.221677876 |
| Mean | 37.03216053 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | 0.4849864348 |
| Sum | 135241.4502 |
| Variance | 1223.789438 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 807 | 22.1% | |
| 1 | 106 | 2.9% | |
| 100 | 89 | 2.4% | |
| 99 | 79 | 2.2% | |
| 3 | 64 | 1.8% | |
| 2 | 55 | 1.5% | |
| 98 | 48 | 1.3% | |
| 97 | 46 | 1.3% | |
| 14 | 45 | 1.2% | |
| 4 | 43 | 1.2% | |
| Other values (93) | 2270 | 62.2% |
| Value | Count | Frequency (%) | |
| 0 | 807 | 22.1% | |
| 1 | 106 | 2.9% | |
| 2 | 55 | 1.5% | |
| 3 | 64 | 1.8% | |
| 4 | 43 | 1.2% |
| Value | Count | Frequency (%) | |
| 101 | 2 | 0.1% | |
| 100 | 89 | 2.4% | |
| 99 | 79 | 2.2% | |
| 98 | 48 | 1.3% | |
| 97 | 46 | 1.3% |
| Distinct | 32 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 565 |
| Missing (%) | 15.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7884677681 |
|---|---|
| Minimum | 0 |
| Maximum | 36 |
| Zeros | 2793 |
| Zeros (%) | 76.5% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 5 |
| Maximum | 36 |
| Range | 36 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.379960264 |
|---|---|
| Coefficient of variation (CV) | 4.286745002 |
| Kurtosis | 42.86467956 |
| Mean | 0.7884677681 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.047057175 |
| Sum | 2434 |
| Variance | 11.42413138 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2793 | 76.5% | |
| 2 | 37 | 1.0% | |
| 5 | 36 | 1.0% | |
| 3 | 26 | 0.7% | |
| 1 | 25 | 0.7% | |
| 4 | 24 | 0.7% | |
| 10 | 19 | 0.5% | |
| 6 | 18 | 0.5% | |
| 9 | 17 | 0.5% | |
| 8 | 15 | 0.4% | |
| Other values (22) | 77 | 2.1% | |
| (Missing) | 565 | 15.5% |
| Value | Count | Frequency (%) | |
| 0 | 2793 | 76.5% | |
| 1 | 25 | 0.7% | |
| 2 | 37 | 1.0% | |
| 3 | 26 | 0.7% | |
| 4 | 24 | 0.7% |
| Value | Count | Frequency (%) | |
| 36 | 1 | < 0.1% | |
| 35 | 2 | 0.1% | |
| 34 | 2 | 0.1% | |
| 32 | 2 | 0.1% | |
| 30 | 4 | 0.1% |
| Distinct | 188 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.153313253 |
|---|---|
| Minimum | -12 |
| Maximum | 14.6 |
| Zeros | 43 |
| Zeros (%) | 1.2% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | -12 |
|---|---|
| 5-th percentile | -4.345 |
| Q1 | -1.1 |
| median | 1.3 |
| Q3 | 3.5 |
| 95-th percentile | 6.5 |
| Maximum | 14.6 |
| Range | 26.6 |
| Interquartile range (IQR) | 4.6 |
Descriptive statistics
| Standard deviation | 3.352714264 |
|---|---|
| Coefficient of variation (CV) | 2.907028299 |
| Kurtosis | 0.0291209036 |
| Mean | 1.153313253 |
| Median Absolute Deviation (MAD) | 2.3 |
| Skewness | -0.1429645275 |
| Sum | 4211.9 |
| Variance | 11.24069294 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 53 | 1.5% | |
| 1.6 | 51 | 1.4% | |
| 2.1 | 49 | 1.3% | |
| 3.3 | 49 | 1.3% | |
| 3 | 49 | 1.3% | |
| 1.1 | 48 | 1.3% | |
| 2.6 | 47 | 1.3% | |
| 3.4 | 46 | 1.3% | |
| 2.9 | 45 | 1.2% | |
| 0.2 | 45 | 1.2% | |
| Other values (178) | 3170 | 86.8% |
| Value | Count | Frequency (%) | |
| -12 | 1 | < 0.1% | |
| -11.7 | 1 | < 0.1% | |
| -11.6 | 1 | < 0.1% | |
| -11.3 | 1 | < 0.1% | |
| -10.8 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 14.6 | 2 | 0.1% | |
| 11.3 | 1 | < 0.1% | |
| 11.2 | 2 | 0.1% | |
| 11 | 1 | < 0.1% | |
| 10.9 | 2 | 0.1% |
| Distinct | 391 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.61771632 |
|---|---|
| Minimum | -9.3 |
| Maximum | 34.9 |
| Zeros | 4 |
| Zeros (%) | 0.1% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | -9.3 |
|---|---|
| 5-th percentile | 0.2 |
| Q1 | 7.5 |
| median | 15.1 |
| Q3 | 21.5 |
| 95-th percentile | 28.3 |
| Maximum | 34.9 |
| Range | 44.2 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.801185208 |
|---|---|
| Coefficient of variation (CV) | 0.6020903003 |
| Kurtosis | -0.8390175138 |
| Mean | 14.61771632 |
| Median Absolute Deviation (MAD) | 6.9 |
| Skewness | -0.07894840193 |
| Sum | 53383.9 |
| Variance | 77.46086107 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 20 | 26 | 0.7% | |
| 21.8 | 23 | 0.6% | |
| 10.4 | 22 | 0.6% | |
| 20.1 | 22 | 0.6% | |
| 18.6 | 21 | 0.6% | |
| 17.3 | 21 | 0.6% | |
| 19.5 | 21 | 0.6% | |
| 11.1 | 21 | 0.6% | |
| 24 | 21 | 0.6% | |
| 24.5 | 21 | 0.6% | |
| Other values (381) | 3433 | 94.0% |
| Value | Count | Frequency (%) | |
| -9.3 | 1 | < 0.1% | |
| -9.1 | 1 | < 0.1% | |
| -9 | 1 | < 0.1% | |
| -8.7 | 1 | < 0.1% | |
| -8.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 34.9 | 1 | < 0.1% | |
| 34.8 | 1 | < 0.1% | |
| 34.6 | 1 | < 0.1% | |
| 34.5 | 2 | 0.1% | |
| 34.3 | 1 | < 0.1% |
| Distinct | 304 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.212568456 |
|---|---|
| Minimum | -16.1 |
| Maximum | 21.5 |
| Zeros | 15 |
| Zeros (%) | 0.4% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | -16.1 |
|---|---|
| 5-th percentile | -4.4 |
| Q1 | 0.6 |
| median | 6.4 |
| Q3 | 11.8 |
| 95-th percentile | 16.5 |
| Maximum | 21.5 |
| Range | 37.6 |
| Interquartile range (IQR) | 11.2 |
Descriptive statistics
| Standard deviation | 6.729163507 |
|---|---|
| Coefficient of variation (CV) | 1.083153217 |
| Kurtosis | -0.8913467671 |
| Mean | 6.212568456 |
| Median Absolute Deviation (MAD) | 5.6 |
| Skewness | -0.1184410974 |
| Sum | 22688.3 |
| Variance | 45.2816415 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10.9 | 32 | 0.9% | |
| 0.5 | 30 | 0.8% | |
| 3.8 | 28 | 0.8% | |
| 8.4 | 27 | 0.7% | |
| 2.5 | 27 | 0.7% | |
| -0.3 | 26 | 0.7% | |
| -0.2 | 26 | 0.7% | |
| 0.6 | 25 | 0.7% | |
| 10.3 | 23 | 0.6% | |
| 3 | 23 | 0.6% | |
| Other values (294) | 3385 | 92.7% |
| Value | Count | Frequency (%) | |
| -16.1 | 1 | < 0.1% | |
| -13.8 | 1 | < 0.1% | |
| -13.4 | 1 | < 0.1% | |
| -12.7 | 1 | < 0.1% | |
| -12.6 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 21.5 | 1 | < 0.1% | |
| 20.9 | 1 | < 0.1% | |
| 20.5 | 1 | < 0.1% | |
| 20.4 | 1 | < 0.1% | |
| 19.9 | 1 | < 0.1% |
| Distinct | 377 |
|---|---|
| Distinct (%) | 10.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.55603944 |
|---|---|
| Minimum | -10.7 |
| Maximum | 32.7 |
| Zeros | 13 |
| Zeros (%) | 0.4% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | -10.7 |
|---|---|
| 5-th percentile | -1.345 |
| Q1 | 4.6 |
| median | 11.9 |
| Q3 | 18.1 |
| 95-th percentile | 24.9 |
| Maximum | 32.7 |
| Range | 43.4 |
| Interquartile range (IQR) | 13.5 |
Descriptive statistics
| Standard deviation | 8.341949876 |
|---|---|
| Coefficient of variation (CV) | 0.7218692804 |
| Kurtosis | -0.8752155081 |
| Mean | 11.55603944 |
| Median Absolute Deviation (MAD) | 6.8 |
| Skewness | 0.015468861 |
| Sum | 42202.65604 |
| Variance | 69.58812773 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 16 | 23 | 0.6% | |
| 3.7 | 22 | 0.6% | |
| 3.3 | 22 | 0.6% | |
| 6.7 | 21 | 0.6% | |
| 8.5 | 20 | 0.5% | |
| 17.2 | 20 | 0.5% | |
| 14.1 | 20 | 0.5% | |
| 5.6 | 20 | 0.5% | |
| 15.4 | 20 | 0.5% | |
| 1.1 | 20 | 0.5% | |
| Other values (367) | 3444 | 94.3% |
| Value | Count | Frequency (%) | |
| -10.7 | 1 | < 0.1% | |
| -10.1 | 1 | < 0.1% | |
| -9.6 | 2 | 0.1% | |
| -9.5 | 1 | < 0.1% | |
| -9.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 32.7 | 1 | < 0.1% | |
| 32 | 1 | < 0.1% | |
| 31.6 | 1 | < 0.1% | |
| 31.5 | 2 | 0.1% | |
| 31.4 | 1 | < 0.1% |
| Distinct | 629 |
|---|---|
| Distinct (%) | 17.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.77343921 |
|---|---|
| Minimum | 0 |
| Maximum | 113.7 |
| Zeros | 511 |
| Zeros (%) | 14.0% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.7 |
| median | 10.5 |
| Q3 | 24.9 |
| 95-th percentile | 54.8 |
| Maximum | 113.7 |
| Range | 113.7 |
| Interquartile range (IQR) | 23.2 |
Descriptive statistics
| Standard deviation | 19.07172298 |
|---|---|
| Coefficient of variation (CV) | 1.137019232 |
| Kurtosis | 3.219842168 |
| Mean | 16.77343921 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.682689055 |
| Sum | 61256.6 |
| Variance | 363.7306173 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 511 | 14.0% | |
| 0.2 | 66 | 1.8% | |
| 0.1 | 45 | 1.2% | |
| 0.3 | 35 | 1.0% | |
| 0.5 | 26 | 0.7% | |
| 1.4 | 25 | 0.7% | |
| 0.8 | 25 | 0.7% | |
| 0.4 | 24 | 0.7% | |
| 7.1 | 24 | 0.7% | |
| 1.1 | 24 | 0.7% | |
| Other values (619) | 2847 | 78.0% |
| Value | Count | Frequency (%) | |
| 0 | 511 | 14.0% | |
| 0.1 | 45 | 1.2% | |
| 0.2 | 66 | 1.8% | |
| 0.3 | 35 | 1.0% | |
| 0.4 | 24 | 0.7% |
| Value | Count | Frequency (%) | |
| 113.7 | 1 | < 0.1% | |
| 113.2 | 1 | < 0.1% | |
| 113 | 1 | < 0.1% | |
| 112.4 | 1 | < 0.1% | |
| 110.1 | 1 | < 0.1% |
| Distinct | 301 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 13 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.347513053 |
|---|---|
| Minimum | 0 |
| Maximum | 70.2 |
| Zeros | 1876 |
| Zeros (%) | 51.4% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3.1 |
| 95-th percentile | 17.61 |
| Maximum | 70.2 |
| Range | 70.2 |
| Interquartile range (IQR) | 3.1 |
Descriptive statistics
| Standard deviation | 7.172589902 |
|---|---|
| Coefficient of variation (CV) | 2.142662266 |
| Kurtosis | 15.86812738 |
| Mean | 3.347513053 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.465247569 |
| Sum | 12181.6 |
| Variance | 51.4460459 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1876 | 51.4% | |
| 0.1 | 124 | 3.4% | |
| 0.2 | 81 | 2.2% | |
| 0.3 | 60 | 1.6% | |
| 0.5 | 47 | 1.3% | |
| 0.4 | 42 | 1.2% | |
| 0.6 | 35 | 1.0% | |
| 0.8 | 29 | 0.8% | |
| 1.2 | 29 | 0.8% | |
| 0.9 | 29 | 0.8% | |
| Other values (291) | 1287 | 35.2% |
| Value | Count | Frequency (%) | |
| 0 | 1876 | 51.4% | |
| 0.1 | 124 | 3.4% | |
| 0.2 | 81 | 2.2% | |
| 0.3 | 60 | 1.6% | |
| 0.4 | 42 | 1.2% |
| Value | Count | Frequency (%) | |
| 70.2 | 1 | < 0.1% | |
| 67.9 | 1 | < 0.1% | |
| 63.6 | 1 | < 0.1% | |
| 58.2 | 1 | < 0.1% | |
| 57.8 | 1 | < 0.1% |
Wind
Real number (ℝ≥0)
| Distinct | 245 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 19 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.016350124 |
|---|---|
| Minimum | 2.2 |
| Maximum | 31.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 2.2 |
|---|---|
| 5-th percentile | 3.6 |
| Q1 | 4.8 |
| median | 6.4 |
| Q3 | 9.8 |
| 95-th percentile | 17.8 |
| Maximum | 31.1 |
| Range | 28.9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.659221494 |
|---|---|
| Coefficient of variation (CV) | 0.58121482 |
| Kurtosis | 3.290420095 |
| Mean | 8.016350124 |
| Median Absolute Deviation (MAD) | 1.9 |
| Skewness | 1.738259749 |
| Sum | 29123.4 |
| Variance | 21.70834493 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5.3 | 85 | 2.3% | |
| 4.8 | 83 | 2.3% | |
| 5.5 | 83 | 2.3% | |
| 5.8 | 74 | 2.0% | |
| 4.7 | 73 | 2.0% | |
| 4.6 | 67 | 1.8% | |
| 6.3 | 67 | 1.8% | |
| 3.8 | 64 | 1.8% | |
| 5.2 | 63 | 1.7% | |
| 4.5 | 62 | 1.7% | |
| Other values (235) | 2912 | 79.7% |
| Value | Count | Frequency (%) | |
| 2.2 | 2 | 0.1% | |
| 2.3 | 2 | 0.1% | |
| 2.4 | 1 | < 0.1% | |
| 2.5 | 7 | 0.2% | |
| 2.6 | 8 | 0.2% |
| Value | Count | Frequency (%) | |
| 31.1 | 1 | < 0.1% | |
| 30.5 | 1 | < 0.1% | |
| 30.3 | 1 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 29.1 | 2 | 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.5 KiB |
| 0 | |
|---|---|
| 1 | 149 |
| 6 | 1 |
| Value | Count | Frequency (%) | |
| 0 | 3502 | 95.9% | |
| 1 | 149 | 4.1% | |
| 6 | 1 | < 0.1% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.5 KiB |
| 0 | |
|---|---|
| 1 | 150 |
| Value | Count | Frequency (%) | |
| 0 | 3502 | 95.9% | |
| 1 | 150 | 4.1% |
Year
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2010.499726 |
|---|---|
| Minimum | 2006 |
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 2006 |
|---|---|
| 5-th percentile | 2006 |
| Q1 | 2008 |
| median | 2010.5 |
| Q3 | 2013 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.87229323 |
|---|---|
| Coefficient of variation (CV) | 0.001428646417 |
| Kurtosis | -1.224127228 |
| Mean | 2010.499726 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | 0.0001444845368 |
| Sum | 7342345 |
| Variance | 8.250068399 |
| Monotocity | Increasing |
| Value | Count | Frequency (%) | |
| 2012 | 366 | 10.0% | |
| 2008 | 366 | 10.0% | |
| 2015 | 365 | 10.0% | |
| 2013 | 365 | 10.0% | |
| 2011 | 365 | 10.0% | |
| 2009 | 365 | 10.0% | |
| 2007 | 365 | 10.0% | |
| 2014 | 365 | 10.0% | |
| 2010 | 365 | 10.0% | |
| 2006 | 365 | 10.0% |
| Value | Count | Frequency (%) | |
| 2006 | 365 | 10.0% | |
| 2007 | 365 | 10.0% | |
| 2008 | 366 | 10.0% | |
| 2009 | 365 | 10.0% | |
| 2010 | 365 | 10.0% |
| Value | Count | Frequency (%) | |
| 2015 | 365 | 10.0% | |
| 2014 | 365 | 10.0% | |
| 2013 | 365 | 10.0% | |
| 2012 | 366 | 10.0% | |
| 2011 | 365 | 10.0% |
Day_in_Month
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.72782037 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 28.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.800529314 |
|---|---|
| Coefficient of variation (CV) | 0.5595517437 |
| Kurtosis | -1.193846831 |
| Mean | 15.72782037 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.006914871984 |
| Sum | 57438 |
| Variance | 77.4493162 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 120 | 3.3% | |
| 28 | 120 | 3.3% | |
| 4 | 120 | 3.3% | |
| 6 | 120 | 3.3% | |
| 8 | 120 | 3.3% | |
| 10 | 120 | 3.3% | |
| 12 | 120 | 3.3% | |
| 14 | 120 | 3.3% | |
| 16 | 120 | 3.3% | |
| 18 | 120 | 3.3% | |
| Other values (21) | 2452 | 67.1% |
| Value | Count | Frequency (%) | |
| 1 | 120 | 3.3% | |
| 2 | 120 | 3.3% | |
| 3 | 120 | 3.3% | |
| 4 | 120 | 3.3% | |
| 5 | 120 | 3.3% |
| Value | Count | Frequency (%) | |
| 31 | 70 | 1.9% | |
| 30 | 110 | 3.0% | |
| 29 | 112 | 3.1% | |
| 28 | 120 | 3.3% | |
| 27 | 120 | 3.3% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.5 KiB |
| Sunday | |
|---|---|
| Monday | |
| Thursday | |
| Wednesday | |
| Tuesday | |
| Other values (2) |
| Value | Count | Frequency (%) | |
| Sunday | 522 | 14.3% | |
| Monday | 522 | 14.3% | |
| Thursday | 522 | 14.3% | |
| Wednesday | 522 | 14.3% | |
| Tuesday | 522 | 14.3% | |
| Saturday | 521 | 14.3% | |
| Friday | 521 | 14.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.142935378 |
| Min length | 6 |
Month
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.5 KiB |
| July | |
|---|---|
| August | |
| March | |
| October | |
| January | |
| Other values (7) |
| Value | Count | Frequency (%) | |
| July | 310 | 8.5% | |
| August | 310 | 8.5% | |
| March | 310 | 8.5% | |
| October | 310 | 8.5% | |
| January | 310 | 8.5% | |
| May | 310 | 8.5% | |
| December | 310 | 8.5% | |
| September | 300 | 8.2% | |
| April | 300 | 8.2% | |
| November | 300 | 8.2% | |
| Other values (2) | 582 | 15.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.148959474 |
| Min length | 3 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Passengers | Revision | Temperature_Midday | Sunshine_Percentage | Snow_5Days | Temperature_Deviation | Temperature_Max | Temperature_Min | Temperature_Evening | Precipiation_5Days | Precipiation | Wind | Holiday | Special_Event | Year | Day_in_Month | Day | Month | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2580 | 0 | 4.9 | 52.0 | 5.0 | 2.3 | 5.1 | -0.8 | 0.9 | 12.1 | 4.6 | 9.9 | 1 | 1 | 2006 | 1 | Sunday | January |
| 1 | 1973 | 0 | 2.4 | 0.0 | 5.0 | 2.3 | 3.4 | 0.6 | 2.7 | 12.4 | 0.3 | 6.1 | 1 | 1 | 2006 | 2 | Monday | January |
| 2 | 1044 | 0 | 2.2 | 11.0 | 5.0 | 1.6 | 2.9 | -1.6 | 1.8 | 12.4 | 0.0 | 5.3 | 0 | 0 | 2006 | 3 | Tuesday | January |
| 3 | 980 | 0 | 0.8 | 0.0 | 0.0 | 0.2 | 1.1 | -1.0 | 0.6 | 10.3 | 0.0 | 6.4 | 0 | 0 | 2006 | 4 | Wednesday | January |
| 4 | 1139 | 0 | -0.3 | 88.0 | 0.0 | -1.8 | 0.9 | -5.4 | -3.2 | 0.3 | 0.0 | 4.5 | 0 | 0 | 2006 | 5 | Thursday | January |
| 5 | 1057 | 0 | -2.7 | 0.0 | 0.0 | -3.2 | -1.8 | -6.7 | -2.9 | 0.3 | 0.0 | 5.1 | 0 | 0 | 2006 | 6 | Friday | January |
| 6 | 914 | 0 | -2.4 | 0.0 | 0.0 | -2.7 | -1.8 | -3.7 | -3.2 | 0.0 | 0.0 | 4.4 | 0 | 0 | 2006 | 7 | Saturday | January |
| 7 | 1581 | 0 | -2.8 | 0.0 | 0.0 | -2.9 | -1.9 | -3.9 | -2.7 | 0.0 | 0.0 | 4.4 | 0 | 0 | 2006 | 8 | Sunday | January |
| 8 | 808 | 0 | -2.7 | 0.0 | 0.0 | -2.9 | -2.3 | -4.0 | -2.9 | 0.0 | 0.0 | 4.6 | 0 | 0 | 2006 | 9 | Monday | January |
| 9 | 792 | 0 | -3.5 | 0.0 | 0.0 | -3.3 | -2.7 | -4.8 | -3.3 | 0.0 | 0.0 | 3.8 | 0 | 0 | 2006 | 10 | Tuesday | January |
Last rows
| Passengers | Revision | Temperature_Midday | Sunshine_Percentage | Snow_5Days | Temperature_Deviation | Temperature_Max | Temperature_Min | Temperature_Evening | Precipiation_5Days | Precipiation | Wind | Holiday | Special_Event | Year | Day_in_Month | Day | Month | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3642 | 0 | 1 | 9.0 | 81.0 | 0.0 | 4.0 | 10.1 | 1.9 | 5.2 | 4.6 | 0.0 | 3.3 | 0 | 0 | 2015 | 22 | Tuesday | December |
| 3643 | 0 | 1 | 9.3 | 85.0 | 0.0 | 3.3 | 10.3 | -0.6 | 5.2 | 4.6 | 0.0 | 4.5 | 0 | 0 | 2015 | 23 | Wednesday | December |
| 3644 | 0 | 1 | 8.7 | 62.0 | 0.0 | 3.1 | 9.4 | 0.5 | 5.1 | 4.6 | 0.0 | 2.9 | 1 | 1 | 2015 | 24 | Thursday | December |
| 3645 | 0 | 1 | 10.2 | 100.0 | 0.0 | 3.3 | 11.3 | 0.3 | 4.1 | 4.6 | 0.0 | 3.2 | 1 | 1 | 2015 | 25 | Friday | December |
| 3646 | 0 | 1 | 7.6 | 100.0 | 0.0 | 1.5 | 9.0 | -0.9 | 2.5 | 0.0 | 0.0 | 2.6 | 1 | 1 | 2015 | 26 | Saturday | December |
| 3647 | 0 | 1 | 8.1 | 99.0 | 0.0 | 1.1 | 9.3 | -1.7 | 2.3 | 0.0 | 0.0 | 2.9 | 0 | 0 | 2015 | 27 | Sunday | December |
| 3648 | 0 | 1 | 1.2 | 0.0 | 0.0 | -1.0 | 2.4 | -2.1 | 0.4 | 0.0 | 0.0 | 2.7 | 0 | 0 | 2015 | 28 | Monday | December |
| 3649 | 0 | 1 | 1.2 | 19.0 | 0.0 | 0.2 | 3.9 | -2.6 | 2.8 | 0.0 | 0.0 | 3.1 | 0 | 0 | 2015 | 29 | Tuesday | December |
| 3650 | 0 | 1 | 2.8 | 0.0 | 0.0 | 1.8 | 4.1 | 1.8 | 2.1 | 0.0 | 0.0 | 4.3 | 0 | 0 | 2015 | 30 | Wednesday | December |
| 3651 | 0 | 1 | 5.1 | 0.0 | 0.0 | 3.0 | 5.7 | 1.7 | 4.8 | 9.2 | 2.8 | 5.3 | 6 | 1 | 2015 | 31 | Thursday | December |